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AMENDMENTS TO THE CLAIMS 



Listing of Claims: 

1. (currently amended) A method e£ handling data for 
determining similarity between portions of gene expression 
profiles in a computer comprising the steps of: 
processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
* correlation function to obtain a data set of gone expression 
profile pairs and a match fraction for each gene expression 
prof i le pair; 

listing gene expression profile pairs in clusters by their 
match fractions; 

removing a first gene expression profile from a cluster when 
another cluster has another gene expression profile with a 
higher match fraction with the first gene expression profile, 
unless the another gene expression profile requires a larger 
number of subsequences to achieve similarity with the first 
gene expression profile; 

j-epeating the removing step unti 1 all gene expression profiles 
are listed in only one cluster; 

providing output of the listing of clusters of gene 
expression profiles. 
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2, (currently amended) A method o£ handling data for 
determining similarity between portions of gene expression 
profiles? comprising the steps of: 

proceiising a n\aml)er of gene expression protiles with <i similar 
sequences algoritlim that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
listing gene ex^jression pairs in clusters by their match 
fractions ; 

removing a first gene from a first cluster when the first gene 
is also in a second cluster which has another gene w.i th a 
higher match fraction with the first gene than any o£ the genes 
in the first cluster have with the first gene, but; 
retaining the first gene in the first cluster and removing the 
first gene from the second cluster when the difference between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is less than a minimum 
difference threshold and the number of subsequences represented 
in the similar gene pair having the highest match fraction in 
the first cluster is higher than the number of subsequences 
represented in the similar gene pair having the highest match 
fraction in the second cluster; 

repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing of clusters of gene 
expression profiles. 
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3. (currently amended) A method of determining similarity 
between portions of gene expression profiles comprising the 
steps of: 

processing -fefee data embodying a number of gene expression 
profiles with a similar seguences algorithm in a computer that 
is a time and intensity invariant correlation function to 
obtain a data set of gene expression pairs and a match fraction 
for each pair; 

choosing a threshold match fraction; 

listing gene expression pairs in clusters by their match 
fractions above the thresholds- 
adding each gene not already in a cluster to a cluster having 
another gene having a highest match fraction with the each gene 
without regard of the thresholds- 
removing a first gene from a cluster when the first gene is 
also in another cluster which has another gene with a higher 
match fraction with the first gene than any of the genes in the 
cluster have with the first gene; 

repeating the removing step until all genes are listed in only 
one cluster; 

providing outpxit of the listing of clusters of gene 
expression profiles. 
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4. (currently amended) A method of handling data for 
deLermining similarity between portions of gene expression 
profiles comprising the steps of: 

processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invaricint 
correlation function with a computer to obtain a data set of 
gene expression pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

listing gene expression pairs in clusters by their match 
fractions above the threshold; 

adding each gene not already in a cluster to a cluster" having 
another gene having a highest match fraction disregarding the 
threshold with the each gene; 

removing a first gene from a first cluster when the first gene 
is also in a second cluster which has another gene with a 
higher match fraction with the first gene than any of the genes 
in the first cluster have with the first gene, but; 
retaining the first gene in the first cluster and removing the 
first gene from the second cluster when the difference between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is less than a minimmn 
difference threshold and the number of subsequences represented 
in the similar gene pair having the highest match fraction in 
the first cluster is higher than the number of subsequences 
represented in the similar gone pair having the highest match 
fraction in the second cluster; 

repeating the removing and retaining steps until all genes are 
listed in only one cluster; 

providing output of the listing of clusters of gene 
expx^ession profiles. 
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5, (currently amended) A method of handling daDr? in a computer 
for determining similarity between genes comprising the steps 
o£: 

listing genes to be compared in a data set by their gene 
expression profiles; 

processing the listed gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

creating a set G in which to list indices o£ genes accounted 
for ; 

assigning genes i and j to a cluster a if they have a match 
fraction greater than the threshold; 

assigning gene k to the cluster a if it has a match fraction 
greater than the threshold with either gene i or gene j; 
assigning genes k and 1 to a cluster b if they have a match 
fraction greater than the threshold and if both gene k and gene 
1 do not have match fractions above the threshold with either 
gene i or gene j ; 

repeating the assigning steps until ail genes to be compared 
have been considered; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gene; 
repeating the removing step until all genes are listed in only 
one c]uster; 

providing output of the listing of clusters of gene 
expression profiles. 
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6. (currently aroended) A method of handling data in a computer 
for determining similarity between genes comprising the steps 
of: 

listing genes to be compared in a data set by their gene 

* 

expression profiles; 

processing the listed gene expression profiles with a similar 
seouences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

creating a set G in which to list indices of genes accounted 
for; 

assigning genes i and j to cluster 1 if they have a match 
fraction greater than the threshold; 

assigning gene k to cluster 1 if it has a match fraction 
greater than the threshold with either gene i or gene j; 
assigning genes k and 1 to cluster 2 if they have a match 
fraction greaLer than the threshold and if both gene k ctnd gene 
1 do not have match fractions above the threshold with either 
gene i or gene j ; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gene, 
unless the another gene requires a larger number of 
subsequences to achieve similarity with the first gene; 
repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing of clusLers of gene 
expression profiles. 



CHA9 2003 0003 USl -9- 
Serial No: 10/829,448 
Filed: July 29, 2003 



PAGE 1 1/23 ' RCVD AT 2/27/2007 1: 19:47 PM [Eastern Standard Time] ' SVR:USPTO-EFXRF-S/10 ' DNIS:273il300 ' CSID:iJScltlb4 ' DURATION (inin-ss):03'20 



1969-12-31 20:44 uscltlb4 704-594-8307 » USPTO P 12/23 



7 , (canceled) 



8 . {canceled) 



9. (canceled) 



10. { previously amended) A program product having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
comprising: 

programmed means for processing a number of gene expression 
profiles with a similar sec[uences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for listing gene expression pairs in clusters 
by their match fractions; 

programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in the cluster have with the first gene; 
programmed means for repeating • the removing step until all 
genes are listed in only one cluster. 
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11. (previously amended) A program product having computer 
readable code stored on a recordable media for determining 
similcirity between portions of gene expression profiles using 
output from a similar sequences algorithm that is a time and 
intensity invariant correlation function comprising: 
programmed means for providing a gene expression profile data 
set as input to programmed means embodying a similar sequences 
algorithm that is a time and intensity invariant correlation 
function to obtain a data set of gene expression pairs and a 
match fraction for each pair as output from the programmed 
means embodying a similar sequences algorithm; 

programmed means for listing the gene expression pairs in 
clusters by their match fractions; 

programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in the cluster have with the first gene; 
programmed means for repeating the removing step until all 
genes are listed in only one cluster. 



CHA9 2003 0003 USl * H" 

Serial No: 10/829,448 
Filed: July 23, 2003 



PAGE 13/23 ' RCVD AT 2/27/2007 1:19:47 PM [Eastern Standard Time] ' SVR:USPTO-EFXRF-S/10 ' DNIS:2738300' CSID:uscltlt)4 ' DURATION (inm-ss):03-20* 



1969-12-31 20:44 



uscLt lb4 



704-594-8307 » USPTO 



P 14/23 



12. (previously amended) A progr^im product having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
comprising the steps of: 

programmed means for processing a number of gene expression 
profiler with a similar seqyx^nces algoritlim that is a Lime and 
intensity invariant correlation function to obtain a data Aet 
of gene egression pairs and a match fraction for each pair; 
programmed means for listing gene expression pairs in clusters 
by their match fractions; 

programmed means for removing a first gene from a first 
cluster when the first gene is also in a second cluster which 
has another gene with a higher match fraction with the first 
gene than any of the genes in the first cluster have with the 
first gene, but; 

programmed means for retaining the first gene in the first 
cluster and removing the first gene from the second cluster 
when the difference between the highest match fraction of the 
first gene with a gene in the first cluster and the highest 
match fraction of the first gene with a gene in the second 
cluster is less than a minimum difference threshold and the 
number of subsequences represented in the similar gene pair 
having the highest match fraction in the first cluster is 
higher than the number of subsequences represented in the 
similar gene pair having the highest match fraction in the 
second clusters- 
programmed means for repeating the removing step until all 
genes are listed in only one cluster* 
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13. (previously amended) A program product having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
compri$inQ the steps of : 

prograxnmed means for processing a mimber of gene expression 
profiles with a similar sequences algorithm that is a time. and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for listing gene expression pairs in clusters 
by their match fractions above the thresholds- 
programmed means for adding each gene not already in a cluster 
Lo a cluster having another gene having a highest match 
fraction with the each gene without regard of the threshold; 
programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in the cluster have with the first gene; 
programmed means for repeating the removing step until all 
genes are listed in only one cluster. 



* 
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14 . (previously amended) A program product having computer 
readable code stored on a recordable media for determining 
similarity between portions of gene expression profiles 
comprising the steps of; 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fractions- 
programmed means for listing gene expression pairs in clusters 
by Lhe^r match fractions above the threshold; 

programmed means for adding each gene not already in a cluster 
to a cluster having another gene having a highest match 
fraction disregarding the threshold with the each gene; 
programmed means for removing a first gene from a first 
cluster when the first gene is also in a second cluster which 
has another gene with a higher match fraction with the first 
gene than any of the genes in the first cluster have with the 
first gene, but; 

programmed means for retaining the first gene in the first 
cluster and removing the first gene from the second cluster 
when the difference between the highest match fraction of the 
first gene with a gene in the first cluster and the highest 
match fraction of the first gene with a gene in the second 
cluster is less than a minimum difference threshold and the 
niomber of subsc<iuences represented in the similar gene pair 
having the highest match fraction in the first cluster is 
higher than the number of subsequences represented in the 
similar gene pair having the highest match fraction in the 
second clusLer; 

programmed means for repeating the removing and retaining 
steps until all genes are listed in only one cluster. 
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15 . (previously amended) A progran^ product having computGr 
readable code stored on a recordable media for determining 
similarity between genes comprising the steps of: 
programmed means for listing genes to be compared by their 
gene expression profiles.; 

programmed means for processing the listed gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for creating a null set G(0) to hold genes 
accounted for; 

programmed means for assigning genes i and j to cluster 1 if 
they have a match fraction greater than the threshold; 
programmed means for assigning gene k to cluster 1 if it has a 
match fraction greater than the threshold with either gene i or 
gene j ; 

programmed means for assigning genes k and 1 to cluster 2 if 
they have a match fraction greater than the threshold and if 
both gene k and gene 1 do not have match fractions above the 
threshold with either gene i or gene j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first gene; 

prograitimed means for repeating the removing step until all 
genes are listed in only one cluster. 
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16. (previously amended) A program product having computer 

« 

readable code stored on a recordable media for determining 
similarity between genes comprising the steps of: 
programmed means for listing genes to be compared by their 
gene expression profiles; 

programmed means for processing the listed gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fractions- 
programmed means for creating a null set G{0) to hold genes 
accounted for; 

programmed means for assigning genes i and j to cluster 1 if 
thev have a match fraction greater than the threshold; 
programmed means for assigning gene k to cluster 1 if it has a 
match fraction greater than the threshold with either gene i or 
gene j ; 

programmed means for assigning genes k and 1 to cluster 2 if 
they have a match fraction greater than the threshold and if 
both gene k and gene 1 do not have match fractions above the 
threshold with either gene i or gene j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first gene, unless the another gene requires a larger 
number of subsequences Lo achieve similarity with the first 
gene ; 

programmed means for repeating the removing step until all 
genes are listed in only one cluster. 



CHA9 2003 0003 USl -16- 
Serial No: 10/829,448 
Filed: July 23, 2003 



PAGE 18/23 ' RCVD AT 2/27/2007 1:19:47 PM [Eastern Standard Time] ' SVR:USPTO€FXRF-5/10 ' DNIS:2738300 ' CSID:uscltlt)4 ' DURATION (min-ss):03^20 



1969-12-31 20:44 uscltlb4 704-594-8307 » USPTO P 19/23 



17 . (canceled) 

18 . (ccinCGled) 

19 . (canceled) 

20 . (previously amended) in a metHod of determining similarity 
between portions of gene expression profiles which includes 
processing a number of gene expression profiles using a 
computer with a similar sequences algorithm that is a Lime and 
intensity invariant correlation fixnction to obtain a data set 
of gene expression pairs and a match fraction for each pair, 
the improvement comprising the steps of: 

listing gene expression pairs in clusters by their match 
tractions ; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gene, 
i^nless the another gene requires a larger number of 
subsequences to achieve similarity with the first gene; 
repeating the removing step until all genes are listed in only 
one cluster; 

providing output of the listing of clusters of gene 
expression profiles. 
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